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Abstract. In this paper we continue our work on adiabatic time of time- 
inhomogcneous Markov chains first introduced in [9] and Our study is an 
analog to the well-known Quantum Adiabatic (QA) theorem which character- 
izes the quantum adiabatic time for the evolution of a quantum system as a 
result of applying of a series of Hamilton operators, each is a linear combination 
of two given initial and final Hamilton operators, i.e. H(s) = (1 — s)Hrj +sHi . 
Informally, the quantum adiabatic time of a quantum system specifies the 
speed at which the Hamiltonian operators changes so that the ground state 
of the system at any time s will always remain e-close to that induced by the 
Hamilton operator H(s) at time s. Analogously, we derive a sufficient condi- 
tion for the stable adiabatic time of a time-inhomogeneous Markov evolution 
specified by applying a series of transition probability matrices, each is a lin- 
ear combination of two given irreducible and aperiodic transition probability 
matrices, i.e., Pt = (1 — i)Prj + tPi- In particular we show that the stable 
adiabatic time i sa[ j(Po, Pi, e) = O (^i x ( e /2)/e 3 ) , where t m i x denotes the 
maximum mixing time over all Pt for < t < 1. 



1. Introduction 

In this paper we study the stability of time-inhomogeneous Markov chains via 
the notion of stable adiabatic time, an extension of the adiabatic time first intro- 
duced in [5] and [3]. Our study is motivated in part by the well-known Quantum 
Adiabatic (QA) theorem which characterizes the quantum adiabatic time for the 
evolution of a quantum system as a result of applying of a series of Hamiltonian op- 
erators, each is a linear combination of two pre-specified initial and final Hamilton 
operators, i.e., H(s) = (1 — s)Ho + sHi. Quantum adiabatic time of a quantum 
system, to be discussed in detail shortly, specifies the rate at which Hamiltonian 
operators change so that the ground state of the system at any time s will always 
remain e-close to that induced by the Hamilton operator H(s) at time s. The 
first Quantum Adiabatic theorem was stated in the 1920s by M. Born and V.A. 
Fock [5], and have been subsequently studied in [8, among others. Recently, the 
quantum adiabatic time plays an important role in the development of quantum 
adiabatic computing. Specifically, quantum adiabatic algorithms are constructed 
as a sequence of Hamilton operators applied to a quantum system in such a way 
that drives the system to the desirable state or output, see for example [TU]. Thus, 
the quantum adiabatic time is a natural choice for characterizing the running times 
of adiabatic quantum algorithms. 

Analogously, we derive a sufficient condition for the stable adiabatic time of a 
time-inhomogeneous Markov evolution specified by applying a series of transition 



probability matrices, each is a linear combination of two given irreducible and ape- 
riodic transition probability matrices, i.e., Pt = (1 — £)Po + £P±. In particular 
we show that the stable adiabatic time t sa d(Po, Pi, e) = O (tf tlix (e/2)/e 3 ) , where 
tmix denotes the maximum mixing time induced over all the transition probabil- 
ity matrices during the evolution. Recently, the stable adiabatic time of a time- 
inhomogeneous Markov evolution have also found practical applications in network 
design [12] and [FT] . 

To motivate our work, we now provide a short overview of the Quantum Adia- 
batic theorem as discussed in [2J. 

1.1. Quantum Adiabatic Theorem. 

Let Ho and Hi be two given Hamiltonian operators. Let T > be a positive 
integer. For t € [0, T], denote H(s) = (1 — s)Ho + sHi, then H(s) is also a 
Hamiltonian operator dependent on a time parameter s = t/T. In general, a 
Hamiltonian operator described above, is not required to have a finite number of 
physical pure states. In this paper we only consider transition probability matrices 
of finite dimension, which are analogous the Hamiltonian operators with a finite 
number of n physical pure states. 

The ground state in quantum mechanics refers to the lowest-energy state. The 
quantum adiabatic theorem concerns one eigenstate of the energy function, the 
ground state. Here we denote 4>(s) as the ground state of H(s) and we let 7(s) be 
the eigenvalue associated with it. For a given T > 0, when we say that we apply the 
adiabatic evolution given by H and 3? for time T we mean to initialize the system 
in the state <&(0) and then apply the continuously varying Hamiltonian H(i/T) for 
time t G [0,T]. 

Given e > the quantum adiabatic theorem informally says that if, by selecting 
a large enough value of T, we assume that the change in the Hamiltonian happens 
slowly enough, then when we apply the adiabatic evolution given by H(s) and $ 
for time T we will be in an e-ball around $(1) with respect to the ^ 2 (C n )-norm. 
This leads us to the following definition. 

Definition 1. Given e > the quantum adiabatic time, denoted as i ga d(H <&, e), 
is equal to the smallest positive time, T, required to make the application of the 
adiabatic evolution given by H and 3? for time T arrive in an e-ball around $(1) 
with respect to the l 2 (C n )-norm. 

The quantum adiabatic theorem gives a sufficient condition for the quantum adi- 
abatic time. The version of quantum adiabatic theorem as proved in [2J motivated 
our work in [§] and [3]. Supposing that all eigenvalues of H(s) are either smaller 
than j(s) — A or larger than j(s) + A (i.e. there is a spectral gap of A around 
Wi(s)), then [2] tells us that 



In [3] the authors suggest that the quantum adiabatic theorem can be described 
as follows: for an infinitely slow change of the system, i.e., at an infinitely large value 
of T, the probability of a quantum state changing energy levels remains infinitely 
small, even for finite values of s = t/T so that s £ [0,1]. 

While the result in [2] was used to motivate the work in [9] and [3], the latter 
notion in [5] is what motivated the work in this paper. We describe an analogue 
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to this quantum system in the context of the classical time-inhomogeneous Markov 
chains. 

1.2. Main Result . 

Markov chains over a finite number of n states are i 1 (R!?i)-norm preserving pro- 
cesses. We denote || • ||i as the / 1 (R") norm throughout the paper. We assume that 
the reader has a prior understanding of basic types of Markov chains, such as irre- 
ducible, aperiodic, time-homogeneous and time-inhomogeneous Markov chains (see 
[B], [7] and [H]). Throughout the paper we focus on discrete-time Markov chains 
and use || • \\tv to denote the total variation norm. We begin with the definition of 
mixing time. 

Definition 2. For e > the mixing time of a time-homogeneous, irreducible and 
aperiodic Markov chain governed by a probability transition matrix P 7 which has 
unique stationary distribution n, is defined as: 

(1) t mix (P, e) = inf{T e N : ||*aP t - n\\ TV < e} 

over all distributions v . 

Mixing times of time-homogeneous irreducible and aperiodic Markov chains have 
been well studied, see for example [1] and [11]. Mixing time characterizes how 
fast a chain converges to its stationary distribution. It is particularly important 
for bounding the running times of many randomized algorithms, for example the 
simulated annealing algorithm and metropolis-hasting algorithm as explained in 
|13) . There have also been recent studies on mixing times for time-inhomogeneous 
Markov chain [14], [15] and [16] which is more closely related to our work. For 
example, Saloff-Coste and Zuniga [M] consider spectral bounds of the mixing time 
for time-inhomogeneous Markov chains on a finite state space when each step in 
transition corresponds to an ergodic Markov kernel with the same stationary mea- 
sure. In their subsequent work [15] , Saloff-Coste and Zuniga employed spectral 
techniques to obtain asymptotic behavior of time-inhomogeneous Markov chains. 
In this work, the concept of c-stability is introduced, which is an abstraction of 
requiring all Markov kernels to have the same stationary measure, and bounds for 
this kind of stability are obtained. 

Our work, on the other hand, studies a specific class of discrete-time, time- 
inhomogeneous Markov chains that was first constructed in [9] . Specifically, we con- 
sider the probability transition matrices for two discrete-time, time-homogeneous, 
irreducible and aperiodic Markov chains over n states. We denote these matrices 
as Po and Pi throughout the paper and call these the initial and the final transi- 
tion matrices respectively. We define for t £ [0, 1] a class of probability transition 
matrices {Pt}te[o,i] such that 

P t = (l-t)P +tP 1 . 

For t € [0,1] we also define %t to be the stationary distribution of Pt. Given 
T G N, the specific time-inhomogeneous Markov chain being considered in our 
paper is the one such that the probability transition matrix at time fc is Pk for 
< k < T. We consider the class of all time-inhomogeneous Markov chains of this 
type over all T € N. We will say that any Markov chain in this class is governed 
by an adiabatic evolution between Po and Pi. 
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Stability of these kinds of Markov chains were described in [5] and [3J using the 
notion of adiabatic time defined as follows: 



Definition 3. For e > the adiabatic time of a time-inhomogeneous, discrete-time 
Markov chain governed by an adiabatic evolution between Pq and Pi, is defined as: 
(2) 

t ad {P , Pi, e) = inf{T* e N : max ||i/P P i P a • • • Pi-T^ry < e /or T e N,T > T*}, 

V T T 

where v is a probability distribution. 

Next, we recall a result from [S] that compares the adiabatic time of a time- 
inhomogeneous Markov chain and the mixing time of the time- homogeneous Markov 
chain governed by the final transition matrix. 

Theorem 1. 

Given a time-inhomogeneous, discrete-time Markov chain governed by an adiabatic 
evolution between the two irreducible and aperiodic Po and Pi, for e > 



(3) i Qrf (P ,Pi, e ) = O'' <La;(Pl ' e/2) 



In the following Proposition, we provide an upper bound on the adiabatic time 
using the square of the mixing time. The proof of this Proposition is given in 
Section |H 

Proposition 1. Given a time-inhomogeneous, discrete-time Markov chain gov- 
erned by an adiabatic evolution between the two irreducible and aperiodic Po and 
Pi, for e>0 

(4) t ad (P ,Px,e)< 2 ^» (Pl ' e/2) . 

e 

Definition [3] suggests that for e > and any T > £ ad (P ,Pi,e) any probability 
distribution will evolve under consecutive applications of P k to an e-ball around 

T 

7Ti in the space of probability distributions with respect to the total variation norm. 
We desire a stronger notion of stability in this paper to match the description of 
the quantum adiabatic theorem mentioned in [5] . We want to select T large enough 
so that starting at ttq, the distribution will evolve under consecutive applications 
of Pk within an e-corridor of ir k for I < k < T. This leads us to the following 
definition. 

Definition 4. Fore > the stable adiabatic time of a time-inhomogeneous, discrete- 
time Markov chain governed by an adiabatic evolution between the irreducible and 
aperiodic Pq and Pi, written as t sa( j(Po,Pi,£), is defined as follows: 

(5) t sad (P , Pi, e) = inf{T e N : ||7r Pi • • ■ P ^ - it* \\ tv < e for 1 < k < T}. 

The main goal of this paper is finding a bound for the stable adiabatic time with 
respect to the maximum mixing time over all the transition probability matrices. 
For e > we let 

tmix{^) — SUp {t m i x (Ps , e) } 

se[o,i] 

and we seek our bound in terms of this t m i X {e). 
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We divide our result into two main theorems to highlight the nature of this 
bound. Our first theorem gives us insight into the nature of the stable adiabatic 
time. Its proof is given in Section 2J 

Theorem 2. Given a time-inhomogeneous, discrete-time Markov chain governed 
by an adiabatic evolution between the irreducible and aperiodic Po andP 1 and given 
5 e (0, 1]. for any e > 0, 

||7ToP^, ' ' ' Pk — 7Tk \\ T v < e 

for 

2t 2 mix (e/2) 
eS 

and 5 < k/T < 1. 

We now state our main result. The proof is given in Section [3J 

Theorem 3. Given a time-inhomogeneous, discrete-time Markov chain governed 
by an adiabatic evolution between two time-homogeneous, discrete-time, n-state, 
irreducible and aperiodic Markov chains with probability transition matrices Po 
and Pi, for any e > , 

(6) t sad (P ,P 1 ,e)=O 

The remaining sections of the paper are organized as follows: in Section [2] we 
state the necessary tools for the proof of our main theorem, in Section [3] we prove 
our main theorem, and Section [4] is dedicated to proofs. 

2. Preliminaries 

We begin this section with a result on the stability of time-homogeneous Markov 
chains. We will find a lower bound for the mixing time of a time-homogeneous, 
discrete-time, irreducible and aperiodic Markov chain governed by the probability 
transition matrix P in terms of the inverse of the smallest nonzero singular value of 
I — P. There are similar results in [llj . where the lower bound for the mixing time 
of a time-homogeneous, discrete-time, irreducible, aperiodic and reversible Markov 
chain governed by the probability transition matrix P is found in terms of the 
inverse smallest nonzero eigenvalue of I — P, or rather in terms of the relaxation 
time for P. One should note that our work is not limited to reversible Markov 
chains. Our work applies to a much larger class of Markov chains. The proof of the 
following Proposition is in Section |4l 

Proposition 2. For a time-homogeneous, discrete-time, n-state, irreducible and 
aperiodic Markov chain, if we are given e > 0, then if a is the smallest nonzero 
singular value ofl — P, 

(7) ^r^<t mix (P,e). 

Zo~ 

This Proposition will be vital for proving Theorem [3] and it gives us some intu- 
ition about the mixing time. There have been many results bounding the relaxation 
time for reversible Markov chains on weighted graphs for example conductance 
bounds and weighted path upper bounds. In both [T] and [1] the authors introduce 
the necessary spectral structure to find these bounds. They also define a Dirichlet 
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r 4«x(e/2) 



form to help derive the well-known Rayleigh Theorem and the Perron-Frobenius 
Theorem, which also describe bounds on the relaxation time. Our work, however, 
does not employ these techniques directly. 

We now find a bound of ||7roPi • • ■ P k — ttw \\tv hr terms of II 7To — ttilIItv- We 

T T T T 

devote the following Proposition to this endeavor and its proof is in Section 2] 
Proposition 3. For 1 < k < T 



(8) IkoPi ■■■Pw - 7rw|| T v < Utt, -7r || TV + (*±1L. 

Now we can use the continuity of 7r s at s — to find an appropriate bound for 
||ttoP i •• - Pk — 7r h 1 1 tv for < k/T < S. We devote the following Proposition to 
the discovery of how 7r s is continuous at s = 0. The spectral structure of Po is 
crucial to this development. The proof is in Section 2] 

Proposition 4. tt s is continuous with respect to the total variation norm at s = 0. 
In particular, for e > if we let a be the smallest nonzero singular value of I — Po, 
then if 

eo~ 

we have for all s < 5, ||7r s — 7To||tv < £• 

Now we can use Proposition [2] along with the fact that t m i X (Po, e) < t m i X (e) to 
derive the following Corollary to Proposition @] 

Corollary 1. 7r s is continuous with respect to the total variation norm at s = 0. 
In particular, for < e < 4/n if 

nm c _ e(4 - ne) 

[ 1 16n^H mtx (e/2) 
we have for all s < 5, \\tt s — 7to||tv < e /2- 

We now have all the necessary tools to find a bound for the stable adiabatic 
time. We find our result and conclude our paper in the following section. 

3. A Bound for the Stable Adiabatic Time 

We devote this section to finding a bound for the stable adiabatic time entirely 
in terms of the largest mixing time. We state our main result in the following 
theorem. 

Theorem. Given a time-inhomogeneous, discrete-time Markov chain governed by 
adiabatic evolution between the irreducible and aperiodic Pq and Pi, for any e > 0, 



(11) UPo,Pi,e)=0 



4ix(e/2) 



Proof. 

We first provide a sketch of the proof followed by the technical details. Our proof 
is based on the results of Theorem [21 Proposition G2 and Corollary [TJ Specifically, 
we divide our proof into two cases. In the first case, we will show how to select T 
and 5 in order to satisfy the two conditions in Theorem [21 namely: 
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T > 2t 2 mix (e/2) 
eS 

and 

6 < k/T < 1. 

Therefore, by Theorem [21 we have 

iKoPi • • • Pk — TTk \\tV < £■ 
T T T 

However, the selected T is not yet t sa d(Po, Pi, e) since this only holds for k such 
that 

5 < k/T < 1. 

In the second case, we will use the results of Proposition [3] and Corollary Q] to show 
that for the same selected T and S, 

IkoPi • • • Pt — 7T_k \\tV ^ e i 

T T T 

even in the case when 

k/T <6 < 1. 

Therefore, we conclude that the selected T is a sufficient condition for t sa< i(Po, Pi, e). 

We now proceed with the details of the proof, starting with the first case. Let 
e > 0. For this fixed e, we choose T be an integer such that 

< ix (e/2) , < <x (e/2) , 1 



T > 



e 3 e 2 e 

2 / ,~s , x 2 



2^ fa (e/2) , 1_ 
This implies 

v ^ > 2C(e/2) | 1 

Multiplying either side by ^fe and subtracting 1 from either side, we obtain 

2*L*(e/2) 



VeVT - 1 > 

e 

Notice that — 1 > because 

2*m«*(e/2) . Q 
e 

Dividing either side of the above inequality by \fl\fT — 1, we obtain 

1 > 2t 2 mix (e/2) 
~ e^VT- 1 

Multiplying either side by T, we obtain 

T> _2*L-*(e/ 2 ) 



' T Ti 



Now, let 

V T T 



then clearly 

2^(e/2) 
eS 

Next, let k be an integer such that 

S = J < - < 1. 

V T T ~ T ~ 

Then by Theorem^ we conclude that 

IkoPi, ••■Pk -^k|| Ty < e. 

Now in the second (complementary) case, i.e., when k/T < S < 1, we will show 
that for the same selected (5 = — ^, and T, it is still true that: 

UttoPi, •••Pi -TTillrv < e, 
Let /c be an integer such that 

fc /T 1 



fc + 1 AT 



Then, 

k 

~ T ~ V T' 

Using Proposition |31 we have 

IKoP X ■ • • Pi — TTi 1 1 TV < IKi — 7I"oI|tV 
n T T T ii n T ii 



(k+lf 



2T 

t /k + r 1 



T 



ii n e 

= 7Ti _ "0 TV + -• 
T 2 

Next, from Corollary Q] as long as e < 4/n and — y — iQn^n^ie/i) ! 
we have 

IKoPi • • • Pi — TTillxy < e 

T T T 

for _ 

k IT 1 

< - < J . 

~ T ~ V T T 



It should be clear that as e — > 0, 



e 1 e(4-ne) 



when 

T > <ix(e/2) , < fa (e/2) , 1 
e° e e 

This tells us that as e —> 0, 



We conclude that 

t sad (P ,-p 1 ,e) = 0' / '- (</2) 



□ 

We see that this result somewhat reaffirms what has been shown in the Quantum 
Adiabatic Theorem in [2], but a main difference is that the inverse spectral gap 
bound for the quantum system is replaced with a mixing time bound in our result. 
Our result also has an extra multiple of 1/e. Notice that the inverse spectral gap 
was a natural choice for the Quantum Adiabatic Theorem due to the Hamiltonian 
matrix being self-adjoint. For general, not necessarily reversible, Markov Chains, 
the Adiabatic Theorem is expressed using mixing times. 

4. Proofs 

4.1. Proof of Proposition 1. 

From the proof in [9] , we notice that 

£ ad (Po,Pi,e) < Kt mi x(Pi,e/2) 

where 




< e/2. 

After performing some basic algebra and taking the natural logarithm of either side 
of the equation, we see that 

In (1 - e/2) < t mlx (Pi , e/2) Mn ^1 + j^—^j J ~ lj 
= t mlx (Pi, e/2) ({K - 1) In (\ + - A 

= (Pi, e/2) ((K-1) fe-iy+i^L^ 

= t mix (Px, e/2) ^(-^-^Ft) 



/ 1 

= * Mfa (P l) e/2) (^E(-^" +1 -(^T )nj + 1 

It is clear now that if we select K large enough so that 

ln(l - e/2) < -t mix (Pi, e/2) f^(-l)i+i ^L— j 



-t m «(Pi,e/2)ln ( 1 ' 



K-1, 

then K will be large enough to satisfy the previous inequality. 
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Exponentiating either side of the equation and performing the basic algebra required 
to solve for K we see that 

K > 1 



( - ln(l- g /2) \ 



i 

ln(l-e/2)' 



tmix (Pi ! e /2) 



1 



= 1+| -ln(l-e/2) A 1 / -ln(l-e/2) 
I imix (Pi, e/2) ^j! V m «(Pi,e/2) 



l 

j'-i ' 



= 1 + 



WgWg) /y, 1^ / _ln(l- e /2) 
-ln(l-e/2) l^j! \,t mte (Pi,e/2) 



Notice that the infinite sum that we have is the sum of positive terms and the first 
term in the sum is 1. This tells us that 

f /-ln(f-e/2)^" 1 



3 = 

therefore 



- j! V*mix(Pi,e/2) 



'ME 



1 /-ln(l-e/2) 



V*mix(Pi,e/2) 



l 



This tells us that if we select K such that 

K > 1 ^mia; (Pi, e/2) 

" -ln(l-e/2) 
then the above inequality will be satisfied. 

Finally we can expand ln(l — e/2) to find that 

2i mfa (Pi,e/2) / ^ 1 /eV'- 1 



K > 1 + 



Again the infinite sum is the sum of positive terms, and the first term in the sum 
is f . This tells us that 

(oo i 
m 

We conclude that if we select K such that 

> 2f m ia;(Pi, e/2) 
~~ e 

then 




Therefore, we see that 

+ CP P A < 2t lnx( p i> e / 2 ) 

t a d(ro, ±1, e) S ■ 

e 

4.2. Proof of Theorem 2. 

To develop the tools for this theorem, we consider the following treatment of our 
probability transition matrices. If we are given s £ (0, 1], then we see that 

t\ t 

1 - - P + -P s 

SI s 



Defining P^ = P s t, we see that 



for all t € [0, si 

(s 

t 

p[ s) = (1 - t)p s) + tp« 

for all t £ [0, 1]. We also define 7r t (s) = 7r st . 

We see that {P[ s) }te [ 0j i] is a class of probability transition matrices where Po = 
P S) and P s = P^ s) . 

Since the time-homogeneous Markov chains determined by Po and P s are irre- 
ducible and aperiodic, we can consider a time-inhomogeneous, discrete-time Markov 
chain governed by adiabatic evolution between these two time-homogeneous Markov 
chains. We can apply Theorem Q] to show that 

M pWpw e)=0 f fL^wa Y 



Now let e > and S <G (0, 1] 



For s € [<5, 1] we have that T* = t a d(P s ' ) , P^, e) is the adiabatic time between P S ^ 



and P ± 



l ■ 

This tells us that 



nuucUi/P^pW ---P^ -tt^Wtv <e. 
Because ttq^ is a specific distribution, we have that 



T* 

= IKqWp^ P^ • • • pW /B) - ^ s) || T v 

(T*/ = ) (T*/ = ) (T*/s) 

= ||""0P 1 P 2 ' ' ' P s(T*/s) - 7T s(T*/s) \\tV 

(T«/s) (T'/B) (T*/*) (T'/s) 

Clearly if T = t ad (P { °\P { °\e)/ S , then 

||7rnPiP_2 ■ • • PsT — 7TsT Wtv < e. 

T T T T 

We showed in Proposition Q] that for e > 

f rP (B) P (S) ,N <^ g^jaCg^j/g) 
Cad(,-^o i e )^ " ' 
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It follows that for e > 



+ (T»(s) p(s) x < 2t 2 rmx( e / 2 ) 



For e > if we let T any integer such that 

2t 2 mix (e/2) 
e5 

we have 

||7ToP 1.P 2. • • Pk — TTk \\tV < e 

for all 5<k/T< 1. 



4.3. Proof of Proposition 2. 

We know that an irreducible, aperiodic time-homogeneous Markov chain governed 
by a probability transition matrix P has a unique stationary distribution, making 
the nullity of (IA — P) equal to one when A — 1. This would necessarily imply that 
the rank of (I — P) is n — 1. 

Let u\ > ■ ■ ■ > (7 n -i — o be the positive singular values of (I — P) with respect 
to the Euclidean inner product, which we will denote || • ||2 throughout this paper. 
This implies that there exists an orthonormal basis {vi, • • • , v n } such that vj(I — 
P)(I - P) T = cr]vj for 1 < j < n - 1 and v„(I - P)(I - P) T = 0. 
Clearly v„ = 7r/||7r|| 2 . 

For t e N define M t _i = I + P + P 2 H h P t_1 . 

Also define n to be the stationary distribution of P. 
Notice that I — P* = (I — P)M t _i. 

For irreducible, aperiodic Markov chains we have that if Ai , • • • A„ are the eigenval- 
ues of P such that 1 = Ai > | A2 1 > ■ ■ ■ > |A n |, then 

1_A| 1-A^ 
' 1 - A 2 ' ' 1 - A„ 

are the eigenvalues of M t -i. Notice that Mt-i must be invertible because all 
eigenvalues are nonzero and also notice that t is the largest eigenvalue in modulus. 

This implies that I - P = (I - P'JMt-r 1 and wc sec that 

o = ||v„_i(I - P)|| 2 = ||v„_i(I - P^Mt-i" 1 !^. 
We see that if || • ||* is the standard matrix norm, then 

||v n _i(I - P*)|| 2 = ||v„_i(I - P^Mt-i-^t-il^ 

< Hvn-iOI-P^Mt-i-^lallMt-!!!. 

^illVn.^I-P^Mt.!" 1 !!, 

= to. 
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If we let u be a vector such that for 1 < i < n, u(i) = whenever v n _i(i) > 
and u(i) = — v n _i(i) whenever v n _i(i) < 0, then we have that = u/||u||i and 
^2 = (v n -i + u)/||v n _i + u||i are probability distributions and 



v n _i(I - P*) = v n _! + (||u||i - ||v„_i + u||i)tt 

- (||V„_1 + U||l (l* - 7T) P* - ||u||l (^1 - 7T) P<) . 

For x, y € R™ such that x and y are probability measures, we see that 

^||x-y|| 2 <||x-y|| Ty <^||x-y|| 2 . 
Through the triangle inequality we see that if we select t = t mix (P 7 e), then 

|| (||V„_1 + U||l {lS 2 - 7T)P* - || U ||l fa - 7T) P*) || 2 < ||V„_1 + U||i • || {v 2 - 7r) P*|| 2 

+ Hull! .||(^-7r)P*|| 2 
>/n||vn-i+u||i-||(i^-7r) P*||rv 



< 

+ 



2 

\HH|i ■ II {vi -7r)P*|| T y 
2 

V^(||v„-i+u||i + ||u||i)e 



< 



2 

\/nj|vn_iJ|ie 
2 

njjvn-ijlge 
2 



2 ' 

Because v n _i and it are orthogonal, we see that 

||V„_1 + (HUD! - HVn-! - U||l)7r|| 2 = ^1 + (||u|| \ \\^ + u|| i) 2 (||7r|| 2 ) 2 

> 1. 

Now through the reverse triangle inequality, meaning that for vectors x and y, 
||x- y|| 2 > ||x|| 2 - ||y|| 2 , we see that if t = t mix (P,e), then 

HVn-xOI-P^^l-^. 

This now implies that 

la 

4.4. Proof of Proposition 3. 

Because 7r P j_ = itq + 7p7r (Pi — Po) for 1 < j < fc we notice that 

T 

7T P J ■ ■ ■ P k - 7Tk = TToPj + l • • • P k - 7Tk + ^7T (Pl - P )Pj + l -Pi 
X T T T X X J_ "T" T 

for 1 < j < k — 1 and 

k 

7T P k - 7T k = (n - 7Tk ) + — 7T (Pl - P ). 
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Using the convention Pj+i 

7T Pi • • • P k — TTk 
T T T 

Taking the total variation norm to either side of the inequality, using the triangle 
inequality and pulling out constants, we see that 

k 

IkoPi, ■••Pi - TTkWtV = 11(710 -TTk) +^-7T (Pl -P )Pj + i •• - Pk || TV 

fc 

< Iko - 7Tk\\ TV + Y, |lKo(Pi - Po)Pm • • • P* ||xv. 

3=1 

Notice that for 1 < j < k — 1 

7T (Pl - P )Pj + l P k = TToPlPj + l P k - TToPoPj + 1 P k 

T T T T ~T~ T 

is the difference between two probability distributions and 

7T (Pl -P ) =7T P 1 -TToPo 

is also the difference between two probability distributions. 

Because we are taking the total variation norm to the difference of two probability 
distributions we see that || • \\tv = h\\ ■ 111 where || • ||i is the ii-norm. 

We have that for probability distributions \i and i/, — v\\tv = — Hli -\ 

|(||mIIi + IIHIi)<i- 

This tells us that ||vr (Pi - P )Pj±i ■ ■•P*||xv < 1 for 1 < j < fc. 
We see then that 

II TToPi ' ' ' P k — 7Tk 1 1 TV < 1 1 7T k 
II T T T T 

= ||7T_k 
T 

= H 

< Ik* 

4.5. Proof of Proposition 4. 

We begin with the creation of an orthonormal basis of eigenvectors associated with 
(I — Po)(I — Po) T by a singular value decomposition similar to the process we 
mentioned in Proposition [2j 

Here we let ax > • • • > <r n _i = a be the positive singular values of (I — Po) with 
respect to the Euclidean inner product. This implies that there exists an orthonor- 
mal basis {vi, • • • , v n } such that vj(I — Po)(I — Po) T = crfvj for 1 < j < n — 1 

14 



■ ■ ■ Pk = I when j > fc, we would see that 

fc 

= (tTq - TTk ) + V |;7r (Pl " P )Pi±i P*. 

3=1 



ttoIItv + Yj 

3=1 



J_ 

T 



kqWtv 

ttoIItv 
ttoIItv 



1 k 

3=1 
fc(fc+ 1) 

2T 
(fc + l) 2 
2T 



and v n (I-P )(I-P ) T = 0. 
Here v n = tt / 1 1 tto 1 1 2 • 

To show continuity at s = let e > and first notice that for any s e [0,1], 
( 7 r s -7r )(I-Po) = S7r s (P 1 -Po). 

Using the Euclidean norm, we see that if Po 7^ Pi and s^O, then 
||(7r B -7r )(I-Po)|| 2 _ J|7r g (P 1 -P )| 2 



||7r s -7To||2 ||7Ts-7ro||2 

Throughout this proof we will use < -, • > as the Euclidean inner product. 
For 1 < j < n let Cj =< ir s — n , Vj >. Then we see that tt s — ir = Ej=i c j v j- 
We have that 

||(7r 8 -7r )(I-Po)||i < (7r.-7r )(I-Po),(7r.-7r )(I-Po) > 



Ks-TTolU < 7T S - 7To,7r s - 7T > 

= < ~ 7T , (tT s - 7To)(I - Pp)(I - Pp) T > 
< 7T S — 7T , 7T S — 7T > 



< Ej=iCjyj,2jj=iCjVj > 

cr 2 c 2 



>(J 2 U = l C l 



'n-1 sr^ n 



En 
.7=1 1 



c 2 



= O, 



" _1 1 E" =1 c 2 



< 7T S - 7T , V n > 



||7Ts ^ 7To||2 

If we let w(s) = (?t s — 7To)/||?t s — 7To 1 1 2 then we see that 

2 (, < 2 ||7r s (P 1 -P )|| 2 

0-„_! I 1 - (< w(s),V„ >) ) < S — . 

Because w(s) and v n are unit vectors, we can use the fact that 

||w( S )|| 2 - 2 < w(«), v n > +||v n || 2 = ||w( S ) - v n || 2 

to show that 

l-<w(s),v„ >= -||w(s) - v n || 2 
and we can use the fact that 

||w( S )|| 2 + 2 < w(«), v n > +||v n || 2 = ||w( S ) + v n || 2 

to show that 

1+ < w(s),v n >= -||w(s) + v n || 2 . 
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From this we see that 1 — (< w(s), v n >) = ||w(s) — v n |||- ||w(s)+v n ||2/4. Plugging 
this into our previous equation, we can see that 

%||wW - v n ||| • ||w( S ) + v n ||l < ^f'-ff 1 - 

After performing some basic algebra we see that 

2a||7r.(Pi -P )|| 2 



|7T S - 7ro||2 < 



(T„_l||w(s) - V„|| 2 ' ||w(s) + V n 1 1 2 ' 



Notice that < w(s), 1 > / \fn = and < v n , 1 > / y/n = 1/ (v^lKolh) for ah 
s G [0, 1]. Because these are the scalar components of the projections of w(s) and 
v n onto 1 respectively, we see that the minimum possible value for ||w(s) — v„ || 2 
and ||w(s) + v n || 2 is at least 1/ (v^lKolh) . 

We now have that 

.. .. ^ 2sn|Ko||M|7r B (P 1 -P )|| 2 

IFs — 7TO || 2 S 

cr n -i 

< 2sn|K B (Pi -P )|| 2 

C n -1 

= 2gnK(Pi-Po)jj2 

(7 

Again for x, y G R™ such that x and y are probability measures, we see that 
I||x-y|| 2 <||x-y|| Ty <^||x-y|| 2 . 

This will imply that 

2 S n 3 / 2 ||7r s (P 1 -P )||TV 



O'n-l 



Because 7r s (Pi — Po) = 7r s Pi — 7r s Po is the difference of two probability distri- 
butions, we see that || • \\tv = 5II ■ ||i where || • ||i is the Zi-norm. This implies 
that 

IMP! - P )||XV = \hs?l - 7T s Po||l < \ (||7r.Pl||l + |ksPo||l) < 1. 

This shows that 

2sn 3 / 2 

Fs — TO TV < • 

a 

Clearly if e > 0, then 

s < = 



2n 3/2 

implies ||7r s — 7r |jTV < £■ 



This shows that 7r s is continuous at s = 0. 
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